K-LM: Knowledge Augmenting in Language Models Within the Scholarly Domain

نویسندگان

چکیده

The use of superior algorithms and complex architectures in language models have successfully imparted human-like abilities to machines for specific tasks. But two significant constraints, the available training data size understanding domain-specific context, hamper pre-trained from optimal reliable performance. A potential solution tackle these limitations is equip with domain knowledge. While commonly adopted techniques Knowledge Graphs Embeddings (KGEs) inject knowledge, we provide a Language Model (K-LM) Resource Description Framework (RDF) triples directly, extracted world knowledge bases. proposed model works conjunction Generative Pretrained Transformer (GPT-2) Bidirectional Encoder Representations Transformers (BERT) uses well-defined pipeline select, categorize, filter RDF triples. In addition, introduce heuristic methods K-LM, leveraging graphs (KGs). We tested our approaches on classification task within scholarly using KGs, results show that has significantly outperformed baselines BERT each KG. Our experimental findings also help us conclude importance relevance KG used over quantity injected Also, injecting increased overall model’s accuracy, demonstrating K-LM choice adaptation solve knowledge-driven problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Augmenting Medical Databases with Domain Knowledge

In this paper we discuss a method of linking databases with domain knowledge to provide an extended semantics for use with statistical machine learning and automated discovery programs We focus on the use of data in conjunction with domain knowledge for automated discovery in medical databases and show how an induction program can nd new knowledge in a database by reasoning about classes and re...

متن کامل

Augmenting domain specific UML models with RDF

Models are created and maintained in the context of their problem domain. Inclusion of domain rules and background knowledge by means of profiles can be complicated and demanding. This paper presents an alternative approach based on the Resource Description Framework, which attaches background knowledge and rules as present in ontologies at the fringes of the model, rather than to include them ...

متن کامل

heterogeneity within the orientalist discourse: representation of the orient in womens travelogues and mens paintings

from 1950s onward, new theories and critical approaches burgeoned across humanities. these theories were context-oriented; as a result, the analysis of discursive practices gained significance. thus, social, political, historical and cultural discourses that have been hitherto marginalized and considered inferior to literary texts, were introduced as important texts to be analyzed by critics. o...

Modelling Scholarly Debate: Conceptual Foundations for Knowledge Domain Analysis Technology

Knowledge Domain Analysis (KDA) research investigates computational support for users who desire to understand and/or participate in the scholarly inquiry of a given academic knowledge domain. KDA technology supports this task by allowing users to identify important features of the knowledge domain such as the predominant research topics, the experts in the domain, and the most influential rese...

متن کامل

Augmenting Domain-Specific Thesauri with Knowledge from Wikipedia

We propose a new method for extending a domain-specific thesaurus with valuable information from Wikipedia. The main obstacle is to disambiguate thesaurus concepts to correct Wikipedia articles. Given the concept name, we first identify candidate mappings by analyzing article titles, their redirects and disambiguation pages. Then, for each candidate, we compute a link-based similarity score to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Access

سال: 2022

ISSN: ['2169-3536']

DOI: https://doi.org/10.1109/access.2022.3201542